A Parallel-Vector Algorithm for Rapid Structural Analysis on High-Performance Computers

نویسندگان

  • Olaf O. Storaasli
  • Duc T. Nguyen
  • Tarun K. Agarwal
چکیده

A fast, accurate Choleski method for the solution of symmetric systems of linear equations is presented. This direct method is based on a variable-band storage scheme and takes advantage of column heights to reduce the number of operations in the Choleski factorization. The method employs parallel computation in the outermost DO-loop and vector computation via the "loop unrolling" technique in the innermost DO-loop. The method avoids computations with zeros outside the column heights, and as an option, zeros inside the band. The close relationship between Choleski and Gauss elimination methods is examined. The minor changes required to convert the Choleski code to a Gauss code to solve non-positive-definite symmetric systems of equations are identified. The results for two large-scale structural analyses performed on supercomputers, demonstrate the accuracy and speed of the method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parallel Spatial Pyramid Match Kernel Algorithm for Object Recognition using a Cluster of Computers

This paper parallelizes the spatial pyramid match kernel (SPK) implementation. SPK is one of the most usable kernel methods, along with support vector machine classifier, with high accuracy in object recognition. MATLAB parallel computing toolbox has been used to parallelize SPK. In this implementation, MATLAB Message Passing Interface (MPI) functions and features included in the toolbox help u...

متن کامل

Linear Static Structural and Vibration Analysis on High-Performance Computers

Parallel computers offer the opportunity to significantly reduce the computation time necessary to analyze large-scale aerospace structures. This paper presents algorithms developed for and implemented on a massively-parallel computers hereafter referred to as Scalable High Performance Computers (SHPC) for the most computationally intensive tasks involved in structural analysis, namely, generat...

متن کامل

High performance computing for wavelet and wavelet packet image coding

The use of high performance computers for wavelet and wavelet packet based image coding is discussed. After a short description of wavelet and wavelet packet methods the existing literature concerning vector, parallel and VLSI wavelet transforms is reviewed. In the following an algorithm for wavelet packet best basis selection on moderate parallel MIMD architectures is introduced and an impleme...

متن کامل

New Fast Algorithms for First-Order Linear Recurrences on Vector Computers

We examine the performance of parallel algorithms for rst-order linear recurrence on vector computers, evaluate them quantitatively on a simple model of vector computers, and propose new fast algorithms. We also show a result of performance benchmarking of them on actual vector computers.

متن کامل

A High-Performance FFT Algorithm for Vector Supercomputers

Many traditional algorithms for computing the fast Fourier transform (FFT) on conventional computers are unacceptable for advanced vector and parallel computers because they involve nonunit, power-of-two memory strides. This paper presents a practical technique for computing the fast Fourier transform that completely avoids all such strides and appears to be near-optimal for a variety of curren...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1990